Ola-7B is a multi-modal language model jointly developed by Tencent, Tsinghua University, and Nanyang Technological University. Based on the Qwen2.5 architecture, it supports text, image, video, and audio inputs, with text content as output.
Safetensors Supports Multiple Languages